> 

A I Kfloo 76 juc ltat» 219 

February 13, J9G5 USP jj ?rojcct ^^ 2 



?fr e COH fl Fgaturg jqj-tSP^L 



DanJc* G. Bobro'j 



Xhe._C&iri_JPejtura In LISP _ a 

JL_ SlilEStt 

The purpose of the COHVX feature is to facilitate certain types of list 

raaatpulatioas in LXSP tl* This feature is a syntactic convenience, rather 
than an extension of the semantics of LISP. It pcrnita the progranwer to 
teat directly whether a piece of list seructur* matches a certain pattern, ard 
if jojto construct another structure utilizing &ub3C£Mnts of tho original 
structure which mctched parts of the given pattern* 

lI...JBttJtetfib,.jfld_Cowtritft Interpreters 

Mie CCSftT feature can bo ioplccanted by programing two interpretive 
fuuetiono in LISP, 

natch [pattern; workspace] 

.and construct [format; array) 

?he ngtch interpreter has two arguments, a pattern of the typo described 
below, and a workspace which is the list which ta to bt matched against the 
pattern* If zbe match fails, the value of the function io NIL. Tf it 
succeeds, the value is a symbolic array which gives the cegnentation of the 
woi'ksp&iGe which allowed a natch. 

The gaflg&Bttt interpreter uses a format, to be described below, and an 
array which is an output of a successful Batch. It constructs a new atructare 
according to this foraat. 



•2- 



A pattern la n Hat of elementary patterns- In the flource language, a 
<£jsttftrn^l3 of the following fora 

( <ep> <Tep> .., <cj>> ) 
vrticr* ©;c1i ^op^ J« an elementary j»atteru, 03 described twlow, A epece 
delimits ouch <ep> . Tn Internal Uaguflse It appears as 
( pAHfiW <ep>* <ep^v „. fep^A ) 

i.^er* wc h« T /c used the # to indicate the LrautfEorrafition of each elementary 
pattern to (ts internal representation- )>i general * will fce wed tc 
indicate the transformation to interne) language a* defined for all of LJSP 
IT* A pattern natcheo a Hut if each elementary pattern Pitches a segc^ut 
of th(i* list, «ed the matched segments, in order, constitute the entire Hot* 
The following are the cleracntery pattern*: 

1) Source Language s 9 
internal t^ngiwge; (ML) 

TMa uttl Mtch any sagnant of the workspace, including a null 
eegmart. If the rest of the pattern raatches the remainder of the uorte.?pscA* 
Ttws smallest possible segment which allows the nwtch lo found* The one 
exception to this rule is that $ aa a last element of a pattern will aatch 
the reusladnr of the workspace*. 

2) Source language: $ i'ltttegcrV fc-s- $1, $2, **#■, $n ? ... 
IntaxuX Languag*£ (00l» <integer>*) (DOUT 1), (MUI 2)>..- 

Por any Integer n, $n matches n consecutive alaoatita of the voritspa<:o. 
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3) Source Language ■ ^s-oxpV e«g- 'BOY, '(A B) > 'A, *THE '#A]M£ 
Internal Language; (QUOTE <Te*exp> ) 

A quoted o-e*preealon matches an identical decent in the workspace. 

4) Source language: <varieble> eg- A, AU*IIA[4] 
Internal Language : <variabie*> 4 

A variable Batches an item in the workspace vhtch la equal to the 

v.'liu- of the variable. 

5) Source Language: < pattern** 
Internal language: <pattcm>^ 

A pattern cmtchcc en ilea in the workspace which is a list, end 
vhich m/itchcg the pattern in the sense defined here. If a cietch is found 
the array Cor thia match is placed in the appropriate position of the array 
for the total oatch, 

6) Source language* $/<.pred*atoa> 
Internal Language: (DOLP <pred-atom> *) 

Katchea anything that $ matches » with the additional condition 
ttat the negffient found munt also eatisfy the predicate of one variabTo 
nacted by the atoca <pi: ?d o _ v.v 1 * 

7} Source Languages $/(tfpred>) 

rnternal Language: (DOIP <pred>*> 

Exactly the eame ac alovo except that <pred> ia any ncn*atou;ic 
predicate of one variable. 
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8) Source Language 1 Sn/ <pred-atoea> *. 3n/(<pvsd>> 

Internal Language: (DOUIP n <prod-jitOQ> *) or (DOLUP n <pred>^) 
Aft ifcove, but matches a segment of n lt*Sl which aatiofieo Che 
predicate of one variable naawi <pred-fltoai> or «^pred> - 

9) Source Language: n or n$m$p e.(;. 4 or 1$2$4 • mftthf*! l >"*f^*>h' l 'S flr> 
luternai Language- (ABHA8K n*> (ABHMK n* m* p*) 

An integer n refers to the contents of the nth eleramt of th« 
array for thl3 match* It raatches an Item in the workspace identical to 
this element In the array* 1$2 would rcfor to the second Item in the array 
stored at the first amy position. This type of<ep> la called an array 
DHlfc or <br-nuirk < >. 

10) Source Language; $//(<n-p«d> <p-arg> n- ) 
Internal Language: (DOLF ^n-prod>^V <p-arg>* " ) 

Hatches anything watched by $ with the additional condition that the 
cc^mrnt matched satisfies the predicate of n-arguments named by <Vi-proa"> . The 
first argument of <n-pred> is implicitly the Hat matched by the $, The 
other n-1 arguments ar» all <p~arg>'s. A <p^arg , > can be on <fmttrk> , a 
<varioblo^ or a '<s-*xp>< They reference previously antched itaasi 
variables a Ad constants. 



II) Source Language; $n//( <n-pted> <i>-arg> n * ) 

internal Lenguage; {DOIHP n* <B-pfO <k-mg>**" 1 ) 

Snae as above but matches a scgueaK of n consecutive items in the 
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workepace. Similarly, we will oU 



ow; 



12) Sourea language: variable //< <o- P red> O-arg} 11 " 1 ) 

Internal LU|Bl|ll (VARF <variable>& <n-prcd> * <p-arg> * n ' 1 ) 



13) Source Language: <aaxark>//< <n-prod> <p-arg> n ~ X ) 

Internal Language (ART <arnark>* *-pr«d>* <p-arg?* u " l j 



14) Source Language: 9$/<c-fcn> 

Internal Language: (DOLL <c-fcn>*) 

This is an escape raechenism to allow the user to construct hie own 
matching function, the -fc-£cn^ is a function of 4 arguments. A, H, W and 
TO. At ran time this function will be given an array A, a number H which 
represents the position of the <bp> , $3, in the pattern, a list V which 
Is the workspace that the <^-fcn> should try to natch, and a functional 
orguaient PH which should be used to match the reot of the workspace beyond 
the point nacehed by the «:-fcri> . If PH succeeds, and <c-fch? matches , 
<c-fcn> should ionert the segment it Batches into the array at position II 
and return the array. If not <c-fcn> should return NIL. 

AoBl.jnmcnt to variables of matching segments of the workspace can 
bo done autoiaatleally within a pettern for <ep>*a $, Si, and $$ In all 
their forms. This is done by preceding the ifop> by the variable, with no 
space between the variable and the following $. In internal language thla is 
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rcprooentcd by 

(SET <vari*blo>* <Wk>*) 
VlieneQUiPia fc5»e fern; of $ s $l f or $5 foliating the variable nan. 



TV* .ggrrcata 

A fonwt Is a list of elat^ntary formate <ef> . In eourcti language 
it appears ac 

(<cf> <cf> <ef> ) 

vboxe successive <cf> 'a ere separated by apace** In intenwl language 
it appears as 

(F0F2IAT <eO> <af>« .**• <<*£>*) 
The list denoted by each <of> Is concatenated into the list denoted by the 
format. If the <ef> denotes an atom, then this atom in cons-*d Into the 
Hot. Tut* elc&eiifcavy formate are; 

1) Source language; l ^e-exp> e.g. 'A f (A B) 
Internal language* (QUOTE ^o*«p> ) 

The quoted <s-exp> lo concatenated or conc-ed in to described above. 
This implies that *A and *(A) will be treated identically. 

2) Source Language: <variablo> 
Internal Language i <variabie>* 

Hm valuo of the variable is Inserted in the reconstructed Hot* 

3) Source Language t <£fi&nSYk^ e.g. 3, *$2 

Internal Language: <auark> * e.g. (ABHAHK 3) (ABHAfiK 4 2) 
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The elesient o£ the array at the politico given by th* *a*;inark> :U 
inserted Into the reconstructed List* If the ^a>^sork> specifies a position 
in Mich there is an array, the elements of this array arc concatenated, And 
ttis new Hot inserted in the roc oust ruction. 

4) Source Language: < format^ 
Internal Language: <foriaat^* 

A <for*aat> can be Inserted as an <ef> within o ^ormat*> * The 
result of reconstructing with thla Inner ^foraat*> and the origiual array 
is cccs**ed into the higher level list, thus allowing construction of 
arbitral iy conplax list structures. 

g) Source Language; fn( <p*arg> ... ^p-ari;>) 

Internal Language: (70HM ftf* <p-arg> * .... £p»tt& *) 

this {ttf} allow the insertion into the reconstruction «© a list 
which is an arbitrary function of the watched subscfiracnto, and other variablea, 
and constants* 

Suppose Vi are given any function of two variables f lu;vl for which 
ve fix a value 5 for u« It is nr.c effectively a function of one variable. Vq 
ca:\ define a function £* [v] which utilizes this constant in its definition. 
Htm f £B;v) a:id f* [v] are called, the latter will run more efficiently- 

Bott match and ^pstaict aro functions of the type discussed above. 
The pattern fox e^cJi* and the CocnAt £or cons t ru est are often known at 



compile time. In LISP II we will compile cella to match and congtrugj 
which have constant patterns and forasta, respectively, by first translating 
them into LISP functions of one variable. I have written a preliminary 
version of this translator forjggtch. 

Y % : Tho .CCjJfT gtatemont 

the COMIT statement in LISP II will provide a convenient way to call 
Mtffr an<i const ruc t implicitly. In source langtage, the Com of the most 
gcnersl CC811T statement will be: 

COMIT H, U<-< F pattom> , A <*~<£oTaat>> - - > • >D<fc-<fonnat> £(»-), PflU) 
In internal language this would be represented as 

{COMIT W, ((U<pattem>*> (A <forrat>*). . .(D <fomat7*» I., * 2 ) 

If Is a locative expression for a list which will b* the workspace 
for the pattern match. U is an array name which wilt be assigned the value 
of the natch if it succeeds. An implicit call is made to match with ths 

pattern and w as arguments. A, B, -,,D are locative expressions which are 
assigned values according to their respective formats if the natch is 
successful* If the match is successful, after assignments to A, ...,D 
control goes lo the statement labeled Q (indicated by Stf}.)). If the ratch 
fails* a transfer is made to 1- (indicated by P(S ? )). 

Many elements of this statement are optional. If "IK" is omitted* 
a local array it created but not named, iff "A**" (the first locative 
expression with a format) tc omitted, it is assimcd to be W. Ho other formats 
need appear. S(£.) and/or Fifl*) rcay he otaitted; transfer will be ©ado to 
the next ctatcnent for en omitted condition label* If either S((.) or F(^ ? ) 
tire rrioofvg, they appear in Internal language as HJL. 



